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Abstract 

The performance of a diffuser in a pump stage depends on its configuration and placement within the stage. 
The influence of vane shape on the hydrodynamic performance of a diffuser has been studied. The goal of 
this effort has been to improve the performance of a pump stage by optimizing the shape of the diffuser 
vanes. The shape of the vanes was defined using Bezier curves and circular arcs. Surrogate model based 
tools were used to identify regions of the vane that have a strong influence on its performance. 
Optimization of the vane shape, in the absence of manufacturing and stress constraints, led to a nearly nine 
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percent reduction in the total pressure losses compared to the baseline design by reducing the extent of the 
base separation. 

Keywords: Diffuser, Multiple surrogates, shape optimization, hydrodynamics 

1. Introduction 

The space shuttle main engine (SSME) is required to operate over a wide range of 
flow conditions. This requirement imposes numerous challenges on the design of 
turbomachinery components. One concept that is being explored is the use of an expander 
cycle for an upper stage engine. A schematic of a representative expander cycle for a 
conceptual upper stage engine is shown in Figured. Oxidizer and fuel pumps are used to 
feed LOX (liquid oxygen) and LH2 (liquid hydrogen) to the combustion chamber of the 
main engine. The combustion products are discharged from the nozzle. The pumps are 
driven by turbines, which use the gasified fuel as the working fluid. 

There is a continuing effort to develop subsystems like turbopumps and turbines used 
in a typical expander cycle based upper stage engine. The requirements on the design of 
subsystems are influenced by size, weight, efficiency, and manufacturability of the 
system. Besides, often the requirements for subsystems are coupled, for example, the 
above-mentioned design constraints require turbopumps to operate at high speeds to have 
high efficiency with low weight and compact design. This may require seeking alternate 
designs for the turbopumps as the current designs may not prove adequate over a wide 
range of operating conditions. Mack et al. [1, 2] optimized the design of a radial turbine 
that allows high turbopump speeds, performs comparable to an axial turbine at design 
conditions, and yields good performance at off-design conditions. 
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Domey et al. [3] have been exploring different concepts for turbopump design. A 
simplified schematic of a pump is shown in Figure 2. Oxidizer or fuel enters from the 
left. The pressure increases as the fluid passes through the impeller. The fluid emerging 
from the impeller periphery typically has high tangential velocity, which is partially 
converted into pressure by passing it over a diffuser. Domey et al. [3] found that a 
diffuser with vanes is more efficient than a vaneless diffuser at off-design conditions. 
Over a range of operating conditions, the performance of a pump is driven by the flow in 
the diffuser. Generally, the diffuser will stall before the impeller, and the performance of 
the diffuser will drop off more rapidly at off-design operating conditions. The main 
objectives of the current effort are to improve the hydrodynamic performance of a 
diffuser, using advanced optimization techniques, and to study the features of the diffuser 
vane that influences its performance. 

Improvement in performance via shape optimization has been successfully achieved 
in many areas. For example, there are numerous instances of improvement in lift to drag 
ratio via airfoil or wing shape design [4-10], shape of blades is optimized to increase the 
efficiency of turbines [4, 11] and pumps [12] in past. Shape design typically involves 
optimization that requires a significant number of function evaluations to explore 
different concepts. If the cost of evaluating a single design is high, as is usually the case, 
surrogate models of objectives and constraints are frequently used to reduce the 
computational burden. 

Surrogate based optimization approach is widely used in the design of space 
propulsion systems, such as radial turbines [1, 2], supersonic turbines [4], diffusers [13], 
rocket injectors [14, 15], and combustion chamber geometry [16], Detailed reviews on 
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surrogate based optimization are provided by Li and Padula [17] and Queipo et al. [18]. 
There are many surrogate models, and it is not clear which surrogate model performs the 
best for any particular problem. In such a scenario, one possible approach to account for 
uncertainties in predictions is to use an ensemble of surrogates (Goel et al. [19]). This 
multiple surrogate approach has been demonstrated to work well for several problems 
including system identification (Goel et al. [20]), compressor blade design (Samad et al. 
[ 12 ]). 

Specifically, the objectives of the present study are: (i) to improve the hydrodynamic 
performance of the diffuser via shape optimization of vanes, (ii) to identify important 
regions in diffuser vanes that help optimize pressure ratio, and (iii) to demonstrate the 
application of multiple surrogates based strategy for space propulsion systems. 

The paper is organized as follows. We define the geometry of the vanes, and 
numerical tools to evaluate the diffuser vane shapes in Section II. We describe the 
relevant details in surrogate model based optimization in Section III. The results obtained 
for the current optimization problem and a discussion of the physics involved in flow 
over optimal vane is presented in Section IV. Finally, we conclude by recapitulating the 
major findings of the paper in Section V. 

2. Problem description 

Representative radial locations in the meanline pump flow path (not-to-scale) are 
shown in Figure 3 (Domey et al. [3]). The fluid enters from the left, and is guided to the 
unshrouded impeller via inlet guide vanes assembly. The flow from the impeller passes 
through the diffuser before being collected in the discharge collector. In this study, we 
focus our efforts on a configuration of. 15 inlet guide vanes, seven main and seven splitter 


4 



blades, and 17 diffuser vanes. The length of the diffuser vanes is also fixed according to 
the location of the collector. Our goal is to maximize the performance of the diffuser, 
characterized by the ratio of pressure at the inlet and the outlet, hereafter called as 
‘pressure ratio’. The performance of the diffuser is governed by the shape of the diffuser 
vane. 

2.1. Vane shape definition 

The description of geometry is the most important step in the shape optimization. The 
current shape of the diffuser vanes, referred as baseline design, (shown in Figure 4(A)) 
was created using meanline and geometry generation codes at NASA [21] and yields a 
pressure-ratio of 1.074. It is obvious from Figure 4(B) that the existing shape allows the 
flow to remain attached while passing on the vanes but causes significant flow separation 
while leaving the vane. The separation induces significant loss of pressure recovery that 
is the primary goal of using a diffuser. The baseline design was created subject to several 
constraints, including: i) the number of vanes was set at 17 based on the number of bolts 
used to attach the two sides of the experimental rig, ii) the vanes had to accommodate a 
3/8-inch diameter bolt, with enough excess material to allow manufacturing, and iii) the 
length of the vanes was set based on the location of the collector. These constraints 
resulted in an initial optimized design that looked quite similar to the baseline design, and 
yielded similar performance in both isolated component and full stage simulations. In an 
effort to more thoroughly explore the design space for diffusers (and allowing for future 
improvements in materials and manufacturing), the constraints resulting from the bolts 
(the number and thickness of the vanes) were relaxed. Other factors that must be 
considered in the following optimization include: i) the designs resulting from the 
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optimization techniques shown in this paper are based on diffuser-alone simulations 
(different results may be obtained if the optimizations were based on full stage 
simulations), and ii) no effort was made to determine if the proposed designs would meet 
stress and/or manufacturing requirements. 

To reduce the separation, we represent the geometry of a vane by five sections using a 
circular arc and Bezier curves as shown in Figure 5. Sections 1,3, and 4 are Bezier 
curves, and Section 2 is a circular arc. The shape of the inlet nose (between points Bi and 
B 2 ) is obtained from the existing baseline vane shape. The circular arc in Section 2 is 
described by fixing the radius r (r = 0.08), the location of the center C; (-3.9, 6.1), the 
start and end angles [21]. The arc begins at angle n + 0 +612 (point Ai) and ends at 

Ji + O -Oi 2 (point A 2 ). Here, we use 0 = 22.5° and 6 is a design variable. We provide 

the information about the coordinates and slope of the tangents at endpoints such that 
different Bezier curves (Appendix 1) are generated by varying the length of tangents. The 
points Ai and A 2 , shown by red dots in Figure 5, and the tangents to the arc, serve as the 
end points location and tangents used to define the Bezier curves in Sections 1 and 3. 
Coordinates and the slopes at points Bi and B 2 (obtained using the data of inlet nose of 
the baseline design [21]) are used to define Bezier curves in Sections 1 and 4. The Bezier 
curve in Section 1 (Figure 5) is defined using the coordinates and slope of tangents at B 2 
and A 2 . The lengths of the tangents t\ and C control the shape of the curve. While one end 
point coordinates and slope for Bezier curves in Sections 3 and 4 are known at points Ai 
and Bi, the second end point and slope are obtained by defining the location of point P 
(Py, P z ). The slope of tangent at point P is taken as five degrees more than the slope of the 
line joining points P and Bj. The additional slope is specified to avoid all the points in 
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Section 4 fall in a straight line. The values of the fixed parameters are decided based on 
the inputs from designer [21]. The lengths of tangents serve as variables to generate 
Bezier curves in Sections 1, 3 and 4. 

The ranges of the design variables, summarized in Table 1, are selected such that we 
obtain practically feasible vane geometries and the corresponding grids (discussed in next 
section). We note that present Bezier curve and circular arc based definition of diffuser 
vane shapes provides significantly different vane geometries than the baseline design, 
particularly near the outlet region. The main rationale behind this choice of vane shape is 
the relaxation of manufacturing and stress constraints, ease of parameterization and better 
control of the curvature of the vane that allows the flow to remain attached. 

2.2. Mesh generation, boundary conditions, and numerical simulation 

Performance of diffuser vane geometries is analyzed using the NASA PHANTOM 
code [22] developed at Marshall Space Flight Center to analyze turbomachinery flows. 
This 3D, unsteady, Navier-Stokes code utilizes structured, overset O- and H-grids to 
discretize and analyze the unsteady flow resulting from the relative motion of rotating 
components. The code is based on the Generalized Equations Set methodology [23] and 
utilizes a modified Baldwin-Lomax turbulence model [24]. The inviscid and viscous 
fluxes are discretized using a third-order spatially accurate Roe’s scheme, and second- 
order central differencing, respectively. The unsteady terms are modeled using a second 
order accurate scheme. 

For this problem, we solve incompressible, unsteady, non-rotating, turbulent, single 
phase, constant-material-property flow over diffuser vane. The working fluid is water. By 
taking the advantage of periodicity, only a single vane is analyzed here. A combination of 
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H- and O-grids with 13065 grid points, has been used to analyze diffuser vane shapes. A 
typical grid is shown in Figure 6. The boundary conditions imposed on the flow domain 
are as follows. Mass flux, total temperature, and flow angles (circumferential and radial) 
are specified at the inlet. Mass flux is fixed at the outlet. All solid boundaries are modeled 
as no slip, adiabatic walls, with zero normal derivative of pressure. Periodic boundary 
condition is enforced at outer boundaries. With this setup, it takes approximately 15 
minutes on a single Intel Xeon processor (2.0 GHz, 1.0 GB RAM) to simulate each 
design. 

3. Methodology 

As discussed earlier, the surrogate model based approach is suitable to reduce the 
computational cost of optimization. In this section, we discuss different steps and 
considerations in surrogate modeling and its application to the design problem(s). A 
stepwise procedure of surrogate based analysis and optimization is explained with the 
help of Figure 7. Firstly, we identify the objectives, constraints, and design variables. 
Next, we develop a procedure to evaluate different designs. In context of the current 
problem, we identified design variables, objectives, and numerical procedure to evaluate 
different designs in the previous section. To reduce the computational expense involved 
in optimization, we construct multiple surrogate models of the objectives and constraints. 
A brief description of popular surrogate models is as follows. 

3.1. Surrogate modeling 

There are many surrogate models e.g., polynomial response surface approximations, 
kriging, radial basis neural network, support vector regression etc. A detailed discussion 
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of different aspects of surrogate modeling was reviewed by Li and Padula [17] and 
Queipo et al. [18]. We give a brief description of different surrogate models as follows. 

3.1.1. Polynomial response surface approximation (. PRS [25]) 

The observed response >’(x) of a function at point x is represented as a linear 
combination of basis functions f(x) (mostly monomials are selected as basis functions) 
and coefficients /? . Error in approximation s is assumed to be uncorrelated and 

normally distributed with zero mean and er 2 variance. That is, 

y(*) = Y J Pifi(*) + e', E(s)-0 ,V(s) = a\ (I) 

i 

The polynomial response surface approximation of y(x) is, 

S’(x) = 'Z l b,f i (x), (2) 

i 

where b t is the estimated value of the coefficient associated with the f h basis function 
f(x). The coefficient vector b is obtained by minimizing the error in approximation 
( e(x) = y(x) - j)(x) ) at N s sampled design points in a least square sense as, 

b = (X T Xy l X T y, (3) 

where X is the matrix of basis functions and y is the vector of responses at N s design 
points. The quality of approximation is measures by computing the coefficient of 
multiple determination R\ defined as, 



_ N s 

where 7 = (N s ) and RMS error at sampling points is given as 

i=i 

Ns 

(N s - N p )“’ ^ (y(x i ) - j)(x ; )) 2 . For a good fit, R 2 adj should be close to 1. For more 

details on polynomial response surface approximation, refer to Myers and Montgomery 
[25]. 

3.1.2. Kriging (KRG, [26]) 

Kriging is named after the pioneering work of D.G. Krige (a South African mining 
engineer) and was formally developed by Matheron [26]. Kriging estimates the value of 
an objective function _y(x) at design point x as the sum of a linear polynomial trend model 

N v 

f and a systematic departure Z(x) representing low (large scale) and high 

/ l 

frequency (small scale) variations around the trend model. 

>(x) = Kx) = X/y;(x) + Z(x). (5) 

i 

The systematic departure components are assumed to be correlated as a function of 
distance between the locations under consideration. Gaussian function is the most 
commonly used correlation function. 

N v 

C (Z(x), Z(s), 0) = ]~j exp {—0. (x,. - s t ) 2 ) • (6) 

/= l 

The parameters /3.,6 are obtained by maximizing a likelihood function that is a 

measure of the probability of the sample data being drawn from it. We use an 
implementation by Lophaven et al. [27]. 
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3.1.3. Radial basis neural network (RBNN, [28]) 

The objective function is approximated as a weighted combination of responses from 
radial basis functions (also known as neurons). 

n rbf 

y(x)= X w ^( x )> (7) 

/= 1 

where a,(x) is the response of the i th radial basis function at design point x and w, is the 
weight associated with a,(x). Mostly Gaussian function is used for radial basis function 
a(x) as 

a = radbas(^s-x\\b);radbas(n) = e~ n . (8) 

The parameter b in the above equation is inversely related to a user defined parameter 
“spread constant” that controls the response of the radial basis function. A higher spread 
constant would cause the response of neurons to be very smooth and very high spread 
constant would result into a highly non-linear response function. Typically, spread 
constant is selected between zero and one. The number of radial basis functions (neurons) 
and associated weights are determined by satisfying the user defined error “goal” on the 
mean square error in approximation. The usual value of goal is the square of five-percent 
of the mean response. 

As discussed here, we have many surrogate models and it is unknown a priori, which 
surrogate would be most suitable for a given problem. Besides, the choice of best 
surrogate model changes with sampling density and nature of the problem [19]. It has 
been shown by Goel et al. [19] that in such scenario simultaneously using multiple 
surrogate models protects us from choosing wrong surrogates. They proposed using a 
weighted averaged surrogate that is described as follows. 
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3.1.4. PRESS-based weighted average surrogate model (PWS, [19]) 
We develop a weighted average surrogate model as, 


AL 


iw (x) = 2>^- (x) ’ 


( 9 ) 


where y^(x) is the predicted response by the weighted average of surrogate models, 


y (x) is the predicted response by the i th surrogate model and w. is the weight associated 
with the i th surrogate model at design point x. Furthermore, the sum of the weights must 


be one 


X w , =1 

V i=1 


so that if all the surrogates agree, y (x) will also be the same. 


pws 


Weights are determined as follows. 



E =Y E /AT ; B<0,cc<l, 

eng i—i i SM’ ^ ’ ’ 

i=l / 

where E, is the global data-based error measure for i th surrogate model. In this study, 
generalized mean square cross-validation error (GMSE) (leave-one-out cross validation 
or PRESS in polynomial response surface approximation terminology), defined in the 

Appendix 2, is used as global data-based error measure, by replacing E. by y jGMSE_ . 
We use a = 0.05 and (5 = -1 . The above mentioned formulation of weighting schemes is 
used with polynomial response surface approximation (PRS), kriging (KRG) and radial 
basis neural networks (RBNN) such that, 

y =w y +w r y. +w , y . (11) 

s pws prs y prs krg y krg rbnn y rbnn 
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For more details about the weighted average surrogate model, we refer the reader to Goel 
etal. [19]. 

Next, we use these surrogate models to characterize the importance of different 
variables and to identify the most and the least important variables for different 
objectives and constraints. This information is vital to understand the role of the 
variables. Also, we can fix the least important variables to reduce the dimensionality of 
the problem. We use global sensitivity analysis method proposed by Sobol [29]. 

3.2. Global sensitivity analysis (GSA, [29]) 

Sobol [29] presented a variance based, non-parametric approach to perform global 
sensitivity analysis. In this approach, the response function is decomposed into unique 
additive functions of variables and their interactions such that the mean of each additive 
function is zero. This decomposition allows the variance (V) to be computed as a sum of 
individual partial variance of each variable (V,) and partial variance of interactions (Vy) of 
different variables. The sensitivity of the response function with respect to each variable 
is assessed by comparing the sensitivity indices (Si, Sy) that is the relative magnitude of 
partial and total variance of each variable. The details of the method are as follows. 

A function /fx) of a square integrable objective as a function of a vector of 
independent uniformly distributed random input variables, x in domain [0, 1 ] is assumed. 
The function can be decomposed as the sum of functions of increasing dimensionality as 

/(*) = f 0 + Z ft (*< ) + Z 4' */) + ' ' ‘ + fn...N v (*i > x 2 » ■ •••> x n v )> ( 1 2 ) 

i i<j 

where f Q = £ . / d x . If the following condition 
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( 13 ) 


)f h .j A = 0 ’ 

0 

is imposed for k = i], i s , then the decomposition described in Equation (12) is unique. 
In context of global sensitivity analysis, the total variance denoted as V(f) can be shown 
equal to 



(14) 


where V(f) = E((f - / 0 ) 2 ), and each of the terms in Equation (14) represents the partial 

contribution or partial variance of the independent variables (V,) or set of variables to the 
total variance and provides an indication of their relative importance. The partial 
variances can be calculated using the following expressions: 


V i = V(E[f |*]), 

V ij =V(E[f\x i ,x j ])-V i -V j , (15) 

Vy^ViEiflx^XjV-Vy-V^-V^-Vi-Vj-V,, 

and so on, where V and E denote variance and expected value respectively. Note that 

E[f\ x ,\~ £ f t dx t , and V(E[f | xj) = f l 2 dx i . This formulation facilitates the 


computation of the sensitivity indices corresponding to the independent variables and set 
of variables. For example, the first and second order sensitivity indices can be computed 
as 



V, 

v{fY 


s y =^-. 

J V(f) 


(16) 


Under the independent model inputs assumption, the sum of all the sensitivity indices 
is equal to one. The first order sensitivity index for a given variable represents the main 
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effect of the variable, but it does not take into account the effect of the interaction of the 
variables. The total contribution of a variable to the total variance is given as the sum of 
all the interactions and the main effect of the variable. The total sensitivity index of a 
variable is then defined as 


g total 


jj*l j,j*i k,k& 

V(f) 


(17) 


Note that the above referenced expressions can be easily evaluated using surrogate 
models of the objective functions. Sobol [29] has proposed a variance-based non- 
parametric approach to estimate the global sensitivity for any combination of design 
variables using Monte Carlo methods. To calculate the total sensitivity of any design 
variable x„ the design variable set is divided into two complementary subsets of x, and Z 
(Z - = l,N v ;j*i^. The purpose of using these subsets is to isolate the influence of 

Xi from the influence of the remaining design variables included in Z. The total sensitivity 
index for x, is then defined as 



(18) 


where 

V! ot<a = V i +V iz , (19) 

where V, is the partial variance of the objective with respect to x,- and V it z is the measure 
of the objective variance that is dependent on interactions between x, and Z. Similarly, the 
partial variance for Z can be defined as V z . Therefore the total objective variability can be 
written as 
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( 20 ) 


v=v i+ v z+ v uz . 

While Sobol [29] had used Monte Carlo simulations to conduct the global sensitivity 
analysis, we use Gauss-quadrature numerical integration of different partial variance 
terms in Equation (15) to calculate sensitivity indices. 

Surrogate models, which represent objectives and constraints, are used to optimize 
the performance of the system. If we use multiple surrogates, we will find more than one 
candidate optimal solutions. The performance of such predicted optimal designs is 
validated using numerical simulation. If we are satisfied with the performance of the 
subsystem, we terminate the search procedure. Otherwise, we refine the design space in 
the region of interest. The design space refinement can be done in multiple ways, (i) we 
can fix the least important variables at the optimal values (as realized by optimization) or 
mean values to reduce the dimensionality of the problem, (ii) we sample more points in 
the design space, and (iii) we identify the region where we expect potential improvement 
and concentrate on that region. We repeat this optimization procedure till convergence. 

4. Results and discussion 

We present application of the above-mentioned surrogate based optimization 
framework to optimize the performance of diffuser vanes in this section. We also present 
a detailed analysis of optimal design. 

4.1. Surrogate model construction 

First step in surrogate modeling is to sample data in design variable space. Different 
design of experiment (DOE) techniques are effective in reducing the computational 
expense of generating high-fidelity surrogate models. The most popular methods DOE 
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methods are Latin hypercube sampling (LHS) and face-centered central composite 
designs (FCCD). The choice of DOE depends on the nature of problem and the number 
of samples required. 

For this problem, we have nine design variables hence we selected 110 design points 
to allow adequate data to evaluate 55 coefficients of a quadratic polynomial response 
surface. Since face-centered central composite design requires unreasonably large 
number of samples (531 points to approximate 55 coefficients), we use Latin hypercube 
sampling (LHS) to construct surrogate models. We generated LHS designs using Matlab 
routine ‘ Ihsdesign ’ with 100 iterations for maximize the minimum distance between 
points. We evaluated each diffuser vane shape using PHANTOM. This dataset is referred 
as ‘Set A’. The range of the data, given in Table 2, indicated potential of improvement in 
the performance of the diffuser by shape optimization. We constructed four surrogate 
models, polynomial response surface approximation (PRS), kriging (KRG), radial basis 
neural network (RBNN), and PRESS-based weighted average surrogate (PWS) of the 
objective pressure ratio. We used quadratic polynomial for PRS, and linear trend model 
with Gaussian correlation function for kriging. For RBNN, the spread coefficient was 
taken as 0.5 and the error goal was the square of 5% of the mean value of the response at 
data points. The parameters a, and fi for PWS model were 0.05 and -1, respectively. The 
summary of quality indicators for different surrogate models is given in Table 2. 

All error indicators are desired to be low compared to the response data, except R 2 , 

which is desired to be close to one. The PRESS (Appendix 2) and RMS error (~1.0e-2) 
were very high compared to the range of data. This indicated that all surrogate models 
poorly approximated the actual response, and were likely to yield inaccurate results if 
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used for global sensitivity analysis and optimization. To identify the cause of poor 
surrogate modeling, we conducted a lack-of-fit test (refer to Appendix 3) for PRS. A low 
p-value (-0.01 7) indicated that the chosen order of the polynomial was inadequate in the 
selected design space. Since, the data available at 1 10 points is insufficient to estimate 
220 coefficients in a cubic polynomial, this issue of model inadequacy also reflected lack 
of data. 

4.2. Design space refinement 

We addressed the issue of model accuracy or data inadequacy using two parallel 
approaches. Firstly, we added more data in the design space to improve the quality of fit. 
We sampled 330 additional points using Latin hypercube sampling such that we had 440 
design points to fit a cubic polynomial (220 coefficients). We call this dataset as ‘Set B’. 
Secondly, the low mean value of the response data (1.041) at 110 points compared to the 
baseline design (1.074) indicated that large portion of the current design space was 
undesirable due to inferior performance. Hence, it might be appropriate to identify the 
region where we expect improvements in the performance of the designs, and construct 
surrogate models by sampling additional design points in that region (reasonable design 
space approach [31]). To identify the region of interest, we used the surrogate models, 
constructed with Set A data (110 design points), to evaluate response at a large number of 
points in design space. Specifically, we evaluated responses at a grid of four Gaussian 
points in each direction (total 4 9 =262,144 points). We chose Gaussian points instead of 
usual uniform grids because Gaussian points lie inside the design domain, and are less 
susceptible to extrapolation errors than comers of uniform grids that might fall outside 
the convex hull of LHS design points used to construct surrogate models. Any point with 
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a predicted performance (due to any surrogate model) of 1.08 units , or better (0.5% 
improvement over the baseline design) was considered to belong to the potential region 
of interest. This process identified 29,681 unique points (~1 1%) in the potential good 
region. We selected 110 points from this data set using D-optimality criterion in this 
smaller region. D-optimal designs were generated using Matlab routine ‘ candexcK with a 
maximum of 100 iterations to maximize D-efficiency ([25], pp. 93). This 110 points 
dataset is called ‘Set C’. 

As before, we conducted simulations at data points in the Sets B and C using 
PHANTOM. One point in each set failed to provide an appropriate mesh. The mean, 
minimum, and maximum values of the pressure ratio for the two datasets are summarized 
in Table 3. We observed only minor differences in the mean pressure-ratio of the Set B 
compared to the Set A (Table 2) but the responses in the Set C data clearly indicated high 
potential of improvement. This demonstrates the effectiveness of reasonable design space 
approach used to identify the region of interest. 

We approximated the data in the Set B and the Set C using four surrogates We 
employed a reduced cubic, and a reduced quadratic polynomial for PRS approximation of 
the Set B and the Set C data, respectively. As can be seen from different error measures 
in Table 3, the quality of surrogate models fitted to the Set B and the Set C data was 
significantly better than the surrogate models fitted to the Set A data. This improvement 
in surrogate approximation was attributed to the increase in the sampling density (Set B) 
allowing a cubic model and the reduction of the design space (Set C). Both PRS models 
did not fail the lack-of-fit test (p-value ~0.90+) indicating the adequacy of fitting 
surrogate model in the respective design spaces. The PRESS metric and weights 
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associated with different surrogates suggested that the PRS was the best surrogate model 
for the Set B and the Set C data, unlike kriging for the Set A data. 

We used the surrogate models fit to the Set B data for global sensitivity analysis and 
the surrogate models fitted to the Set C data for optimization. The optimization of the 
performance using the surrogate models fitted to the Set B data was inferior to the 
optimal design obtained using the Set C data based surrogates. Some of the optimal 
designs from the Set B data based surrogates could not be analyzed. This anomaly arose 
because large design space was sampled with limited data such that large regions remain 
unsampled; and hence, susceptible to significant errors, particularly near the comers 
where optima were found. The same issue restricted the use of surrogate models 
constructed using Set C data for global sensitivity analysis as there is large extrapolation 
error outside the region of interest where no point was sampled. Hence, surrogate models 
constructed using Set B data were more suited for conducting global sensitivity analysis. 

4.3. Global sensitivity assessment 

We used global sensitivity analysis (GSA) to identify the most and the least important 
design variables. We used Gauss quadrature numerical integration scheme with four 
Gauss points along each direction (total 4 9 =262, 144 points) to evaluate different integrals 
in GSA. The response at each point was evaluated using surrogate models fit to the Set B 
data. Corresponding sensitivity indices of main effects of different variables are shown in 
Figure 8. Though there are differences in the exact magnitude of sensitivity indices from 
various surrogates, all surrogates indicated that the pressure ratio was most influenced by 
three variables, P z , t 2 , and P y . A comparison of sensitivity indices of total and main effect 
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of design variables (using PWS) in Figure 9 suggested that the interactions between 
variables are small but non-trivial. 

4.4. Validation of global sensitivity analysis results 

To validate the findings of the global sensitivity analysis, we evaluated the variation 
in the response function (pressure ratio) by varying one variable at a time while keeping 
remaining variables at the mean values. We specified five equi-spaced levels for each 
design variable and used trapezoidal rule to compute the actual variance. The responses at 
design points were evaluated by performing actual numerical simulations. The results of 
actual variance computations are shown in Figure 10. 

The one-dimensional variance computation results also indicated that variables P z , t 2 , 
and P y were more important than all other variables. This validated the findings of the 
global sensitivity analysis. The differences in the results of one-dimensional variance 
computation and global sensitivity analysis can be explained as follows: (i) the number of 
points used to compute one-dimensional variance is small, (ii) one-dimensional variance 
computation does not account for interactions between variables, and (iii) there are 
approximation errors in using surrogate models for global sensitivity analysis. 
Nevertheless, the main implications of the finding clearly show that the performance of 
the diffuser vane was pre-dominantly affected by the location of point P and the length of 
tangent in Section 1 (Figure 5). 

4.5. Preliminary optimization of diffuser vane performance 

Next, we used surrogate models, fit to the Set C data, to maximize the pressure-ratio 
by exploring different diffuser vane shapes. To avoid the danger of large extrapolation 
errors in the unsampled region, we employed the surrogate model (PRS surrogate) from 
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the Set A as a constraint (all points in the feasible region have predicted response greater 
than a threshold value of 1 .08). Our use of PRS from the Set A as the constraint was 
motivated by the simplicity of the constraint function, and the fact that PRS contributed 
to the most number of points in the potential region of interest. 

We used sequential quadratic programming optimizer to find the optimal shapes. The 
optimal configuration of blade shapes obtained using different surrogate models as 
function evaluators is shown in Figure 11, and the corresponding optimal design 
variables are given in Table 4. The optimal designs obtained from all surrogates were 
close to each other in both function and design space. A few minor differences were 
observed in relatively insignificant design variables (refer to the results of global 
sensitivity analysis). Notably all design variables touched the bounds for PRS and were 
close to the corner for other surrogate models. The small value of 0 indicated sharper 
nose, as was used in the baseline design. Also, most tangents were at their lower bounds 
that resulted in low curvature sections. Near the point P, the tangents were on their upper 
limits to facilitate gradual transition in the slope. The optimal vane was thinner in the 
middle section and was longer compared to the baseline design (Figure 11). The central 
region of the optimal design was non-convex compared to the convex section for the 
baseline design. 

We simulated all four candidate optimal designs from different surrogate models to 
evaluate the improvements. The actual and predicted performances from different 
surrogates are compared in Table 5. We observed that the error in approximation for 
different surrogates was comparable to their respective PRESS errors. Nevertheless, PRS 
was the most accurate surrogate model and furnished the best performance shape. RBNN 
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was the worst surrogate model. PRESS-based weighted average surrogate model 
performed significantly better than the worst surrogate. The best predicted diffuser vane 
yielded significant improvements in the performance (1.117) compared to the baseline 
design (1.074). We refer to this design as ‘intermediate optimal’ design. 

It is obvious that the design space refinement in the region of interest based on 
multiple surrogates has pay-offs in the improved performance of surrogates and the 
identification of optimal design. The high confidence in the optimal predictions was also 
derived from the similar performance of all surrogate models. The results also showed the 
incentives (protection against the worst design, proper identification of the reasonable 
design space) of investing a small amount of computational resources in constructing 
multiple surrogate models (less than the cost of a single simulation) for computationally 
expensive problems, and then the extra cost of evaluating multiple optima. 

We compared instantaneous (at the end of simulation) and time-averaged flow fields 
from the intermediate optimal design (from PRS) with the baseline design in Figure 12. 
The intermediate optimal design allowed smoother turning of the flow compared to the 
baseline design, and reduced the losses due to separation of the flow, which were 
significantly high in the baseline design. Consequently, the pressure at the outlet was 
higher for this intermediate optimal design. We also noted the increase in pressure on the 
vane for the intermediate optimal design. 

4.6. Design space refinement via dimensionality reduction 

We used first design space refinement by identifying the region of interest. This 
helped in identifying the intermediate optimal design. Since most design variables in the 
optimal design were at the boundary of the design space (Table 4), further improvements 
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in the performance of the diffuser might be obtained by refining the design space. To 
reduce the computational expense, we reduce the design space by utilizing the findings of 
the global sensitivity analysis. We fixed six relatively insignificant design variables at the 
optimal design (predicted using PRS), and expanded the range of the three most 
important design variables. The modified ranges of the design variables and the fixed 
parameters are given in Table 6. We selected 20 design points using a combination of 
face-centered central composite design (15 points FCCD), and LHS designs (5 points) of 
experiments. The range of pressure ratio at 20 design points is given in Table 7. Note that 
all the tested designs in the refined design space perform better than the intermediate 
optimal design. 

As before, we constructed four surrogate models in the refined design space. The 
performance metrics, specified in Table 7, indicated that all surrogate models 
approximated the response function very well. The weights associated with different 
surrogate models, suggested that a quadratic PRS approximation represents the data the 
best. This result is not unexpected since any smooth function can be represented by a 
second order polynomial if the domain of application is small enough. As before, PWS 
model was comparable to the best surrogate model. 

4. 7. Final optimization 

The design variables for the four optimal designs of the diffuser vane obtained using 
different surrogate models and corresponding surrogate-predicted and actual (CFD 
simulation) pressure ratio are listed in Table 8. The error in predictions of surrogate 
models compared well with the quality indicators and all surrogate models had only 
minor differences in the performance. Also, the optimal vane shapes from different 
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surrogates were similar. In this case also, polynomial response surface approximation 
conceded the smallest errors in prediction. While the performance of the optimal diffuser 
vane has improved compared to the intermediate optimal design (compare to Table 5), 
the contribution of the optimization process was insignificant. One of the data points = 
0.60, P y = -2.00, P z = 6.00 ) resulted in a better performance (P -ratio = 1.151 ) than the 
predicted optimal. This result was not surprising because the optimal design existed at a 
comer that was already sampled leaving little scope for further improvement. 
Nevertheless, the optimizers correctly concentrated on the best region. As expected, the 
optimized design is thinner and streamlined to further reduce the losses, and to improve 
the pressure recovery. We analyzed the optimal diffuser vane shape according to the flow 
structure and the other considerations as follows. 

4.8. Flow structure 

The instantaneous and time-averaged pressure contours for the optimal vane shape 
(best data point) are shown in Figure 13. We observed further reduction in separation 
losses, and smoother turning of the flow compared to the intermediate optimal design 
obtained before design space refinement (Table 5). Consequently, the pressure rise in the 
diffuser was higher. The optimal vane shape had a notable curvature in the middle section 
on the lower side of the vane (near the point P). This curvature decelerated the flow and 
led to faster increase in pressure (notice the shift of higher pressure region towards the 
inlet in Figure 12 and Figure 13). 

4.9. Vane loadings 

The shapes and pressure loads on the baseline, intermediate optimal, and final optimal 
vanes are shown in Figure 14. We noted the increase in the mean pressure on the diffuser 
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vane for the optimal design. The pressure loads near the inlet tip, and the pressure loading 
on the diffuser vane, given by the area bound by the pressure profile on the two sides, had 
reduced by optimization. However, the optimized diffuser vane might be susceptible to 
high stresses as the optimal design was thinner compared to the baseline vane. The 
intermediate optimal design served as a compromise design with relatively higher 
pressure ratio (~4%) compared to the baseline design and lower pressure loading on the 
vanes compared to the optimal design. 

In the future, this problem would be studied by accounting for manufacturing and 
structural considerations like, stresses in the diffuser vanes. One can either specify a 
constraint to limit stress to be less than the feasible value or alternatively, one can solve a 
multi-objective optimization problem with two competing objectives, minimization of 
stress or pressure loading in the vane, and maximization of pressure ratio. 

4.10. Empirical considerations 

Typically, the vane shape design is carried out using empirical considerations on the 
gaps between adjacent diffuser vanes as shown in Figure 15. The empirical suggestions 
on the ratio of different gaps [32] and actual values obtained for different vanes are given 
in Table 9. Contrary to the empirical relations, the ratio of length to width gap (L/Wi), 
and ratio of width gaps (W 2 /W 1 ) decreased as the pressure ratio increases, though the 
actual magnitude of length and width of the gaps increase. The discrepancies between the 
optimal design and the empirical optimal ratios [32] are explained by multiple design 
considerations used for the empirical optimal. Firstly, we note that the optimization was 
carried out for a 2D diffuser vane not for the combination of vanes and flow for which 
empirical ratios are provided. This allowed a variable height of the vane for optimization. 
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However, the empirical ratios are obtained by assuming a constant channel height so that 
the area ratio of the channel reduces to the ratio of width gaps (W2/W1). Nevertheless, 
this requires further investigation to understand the cause of discrepancies between 
empirical ratios and that obtained for optimal design. 

5. Summary 

We used surrogate model based optimization strategy to maximize the hydrodynamic 
performance of a diffuser, characterized by the increase in pressure ratio, by modifying 
the shape of the vanes. The shape of the diffuser vanes was defined by a combination of 
Bezier curves and a circular arc. Firstly, we defined the shape of the vane using nine 
design variables and used surrogate models to represent the pressure ratio. We used lack- 
of-fit test to identify the issues of model inadequacy, and insufficiency of the data to 
represent the pressure ratio. We addressed these issues by, (i) adding more data points, 
and (ii) identifying the region of interest using the less-accurate surrogate models. More 
samples were added in the region of interest using the information from multiple 
surrogate models. 

The surrogate models, constructed with increased data and/or in smaller design space, 
were significantly more accurate than the initial surrogate models. Also, during the 
course of design space refinement, the best surrogate model changed from kriging (initial 
data) to polynomial response surface approximation (all subsequent results). Had we 
followed the conventional approach of identifying the best surrogate model with the first 
design of experiments, and then using that surrogate model for optimization, we might 
have not captured the best design. Thus, we can say that the results reflect the 
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improvements in the performance using the design space refinement approach, and using 
multiple surrogates constructed by incurring a low computational cost. 

We conducted a surrogate model based sensitivity analysis to identify the most 
important design variables in the entire design space. Three design variables controlling 
the shape of the upper and lower side of the vane were found to be most influential. We 
used surrogate model in the reduced design space to identify the optimal design in nine 
variable design space. This intermediate optimal design improved the pressure ratio by 
more than four percent compared to the baseline design. 

Since all the design variables for intermediate optimal design hit the bounds, we 
further refined the design space by fixing the least important variables on optimal values 
to reduce the design space, and relaxing the bounds on the most important design 
variables. The optimal design obtained using the surrogate models in the refined design 
space further improved the performance of the diffuser by more than seven percent 
compared to the baseline design. The pressure losses in the flow were reduced, and a 
more uniform pressure increase on the vane was obtained. However, the optimal vane 
shape might- be susceptible to fail due to high stresses. This behavior was attributed to the 
absence of stress constraint that allowed using thinner vanes to maximize the 
performance. In the future, the optimization would be carried out by considering the 
multi-disciplinary analysis accounting for stress constraint, manufacturability, and 
pressure increase. 

In terms of the vane shapes, thin vanes helped improve the hydrodynamic 
performance significantly. The interesting aspect was the change in the sign of curvature 


28 



of the vane on the suction side that allows an initial speeding of the flow followed by a 
continuous pressure recovery without flow separation. 
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Appendix 1: Bezier Curve 

A typical parametric Bezier curve f(x) x shown in Figure 16, is defined with the help of 
two end points Po and Pi, and two control points P 2 and P 3 as follows. 

fix) = P (I - x)' +3Px(l -xf +3P 2 x 2 (l-x) + P 3 x 3 ,xe [0,11. (21) 

The co-ordinate of any point on the Bezier curve is obtained by substituting the value 
of x accordingly. In this study, the location of control points is obtained by' using the 
information about the slope and the length of the tangents at end points (Papila et al. [4]). 

Appendix 2: Generalized Mean Square Cross-Validation Error (GMSE or PRESS) 

In general, the data is divided into k subsets (A- to Id cross-validation) of 

approximately equal size. A surrogate model is constructed k times, each time leaving out 
one of the subsets from training, and using the omitted subset to compute the error 
measure of interest. The generalization error estimate is computed using the k error 
measures obtained (e.g., average). If k equals the sample size, this approach is called 
leave-one-out cross-validation (also known as PRESS in the polynomial response surface 
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approximation terminology). Equation (22) represents a leave-one-out calculation when 
the generalization error is described by the mean square error (GMSE). 

GMSE = jf i (y,-y<-y, (22) 

K i = 1 

where yy’ represents the prediction at x (,) using the surrogate constructed using all 

sample points except (x (0 , y, ). Analytical expressions are available for that case for the 
GMSE without actually performing the repeated construction of the surrogates for both 
polynomial response surface approximation (Myers and Montgomery [25], Section 2.7) 
and Kriging (Martin and Simpson [30]) however here we used brute-force. The 
advantage of cross-validation is that it provides nearly unbiased estimate of the 
generalization error and the corresponding variance is reduced (when compared to split- 
sample) considering that every point gets to be in a test set once, and in a training set k - 1 
times (regardless of how the data is divided). 

Appendix 3: Lack-of-Fit Test With Non-replicate Data for Polynomial Response 
Surface Approximation 

A standard lack-of-fit test is a statistical tool to determine the influence of bias error 
(order of polynomial) on the predictions [25]. The test compares the estimated 
magnitudes of the error variance and the residuals unaccounted for by the fitted model. 
Lets say, we have M unique locations of the data and at j th location, we repeat the 
experiment rij times, such that total number of points used to construct surrogate model is 

M 

N = V n . The sum of squares due to pure error is given by, 

y=> 

®.=£i:bv-- p T < 23 > 

7=1 k=l 
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where y is the mean response at the j th sample location, given as, y = — V y . 

J J vi k 

H j k==1 

The sum of square of residuals due to lack-of-fit of the polynomial response surface 
model is, 


u , 

'- v ) • (24) 

7=1 

where j)(x ) is the predicted response at the sampled location x^ . In matrix form, the 
above expressions are given as. 


M 


ss 


P e 


-z 

7=1 1 



1 1 T 

n. n . 
j j 



(25) 


M ( 


ss. 


lof 


=y r ('„ 

v ’ 7=1 1 


1 T A 

— 1 1 T 


(26) 


where 1 is the ( « x 1 ) vector of ones, I is (n x n ) identity matrix, L, is 

n j 7 y rtj v 7 7 N s 

( x ) identity matrix. 

We formulate F-ratio using the two residual sum of squares as, 


F = 


1SS Id V 

\ P*l pe ) 


(27) 


where d = N -N , and d =N —M , are the degrees of freedom associated with 

lof s p pe s 7 ° 

SS hf and SS pe , respectively. The lack-of-fit in the surrogate model is detected with a- 
level of significance, if the value of F in Equation (27), exceeds the tabulated F 

a ’ d lof’ d pe 

value, where the latter quantity is the upper 100a percentile of the central F-distribution. 
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When the data is obtained from the numerical simulations, the replication of 
simulations does not provide an estimate of noise ( SS pe ), since all replications return 
exactly the same value. In such scenario, the variance of noise can be estimated by 
treating the observations at neighboring designs as “near” replicates ([33], pp. 123). We 
adopt the method proposed by Neill and Johnson [34], and Papila [35] to estimate the 
lack-of-fit for non-replicate simulation. In this method, we denote a near-replicate design 
point Xjk (as the replicate of the j th point ) such that 

x =x.+8 , (28) 

jk J jk 

where 8 represents the disturbance vector. Then, the Gramian matrix is written as 

jk 

X = X + A,, (29) 

where X matrix is constructed using x . for near replicate points and matrix A = X -X . 

Now the estimated response at the design points (including near-replicates) is given as 
y =y-Ab, (30) 

where b is given in Equation (3). Now, we compute SS pe and SSi 0 f by replacing y , y n , 

and X in Equations (25) and (26) with y ,y n , and X . 
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Figures 
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Figure 1 A representative expander cycle used in the upper stage engine (courtesy wikipedia). 
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Figure 2 Schematic of a pump. 



Figure 3 Meanline pump flow path (IGV: Inlet guide vane). 
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Figure 4 Baseline diffuser vane shape and time-averaged flow. 



Figure 5 Definition of the geometry of the diffuser vane (refer to Table 1 for variable description). 



Figure 6 A combination of H- and O-grids to analyze diffuser vane. Body-fitted O-grids are shown in 
green and algebraic H-grid is shown in red. 
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Figure 7 Surrogate based design and optimization procedure. 



(C) Radial basis neural network (D) PRESS-based weighted average 

Figure 8 Sensitivity indices of main effect using various surrogate models (Set B). 
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Figure 9 Sensitivity indices of main and total effects of different variables using PWS (Set B). 



Figure 10 Actual partial variance of different design variables (no interactions are considered). 
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Figure 11 Baseline and optimal diffuser vane shape obtained using different surrogate models. PRS 
indicates function evaluator is polynomial response surface, KRG stands for kriging, RBNN is radial 
basis neural network, and PWS is PRESS-based weighted surrogate model as function evaluator. 
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(A) Instantaneous - intermediate optimal design (B) Time averaged - intermediate optimal design 
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(C) Instantaneous - baseline design (D) Time averaged - baseline design 


Figure 12 Comparison of instantaneous and time-averaged flow fields of intermediate optimal (PRS) 
and baseline designs. 
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(A) Instantaneous pressure (B) Time-averaged pressure 

Figure 13 Instantaneous and time-averaged pressure for the final optimal diffuser vane shape. 
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(A) Different vane shapes 



(B) Pressure loadings on the vanes 


Figure 14 Different vane shapes and corresponding pressure loadings. 



Figure 15 Gap between adjacent diffuser vanes. 



Figure 16 Parametric Bezier curve. 
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Tables 


Table 1 Design variables and corresponding ranges. Angle 0 controls the size of circular arc, ti-tg 
define the lengths of the tangents of Bezier curves and determine its curvature, and (Py, Pz) are the 
coordinates of the point P in the middle of the section. Angle 0 is given in degrees and all other 
dimensions are scaled according to the baseline design. 



Minimum 

Maximum 

0 

60 

no 

ti 

1.25 

2.50 

t2 

0.75 

1.50 

t3 

0.05 

0.30 

U 

0.50 

1.00 

ts 

0.50 

1.00 

u 

1.00 

2.00 

Py 

-2.50 

-2.00 

Pz 

5.70 

5.85 


Table 2 Summary of pressure ratio on data points and performance metrics for different surrogate 
models fitted to Set A. We tabulate the weights associated with the surrogates used to construct 
PRESS-based weighted surrogate (PWS). PRS: Polynomial response surface, KRG: Kriging, RBNN: 
Radial basis neural networks, RMSE: Root mean square error, PRESS: Predicted residual sum of 
squares. Here we give the square root of PRESS so as to facilitate each comparison with RMSE. 


Surrogate 

Parameter Value Weight 

Pressure 

ratio 

# of points 110 

Minimum of data 1.001 

Mean of data 1.041 

Maximum of data 1.093 

PRS 

Ku 0.863 

RMSE 8.00e-3 

PRESS 1.33e-2 °-32 

Max Error 2.62e-2 

Mean Absolute Error 4.40e-3 

KRG 

Process Variance 1.09e-4 

PRESS 1.02e-2 041 

RBNN 

PRESS 1.58e-2 

Max Error 2.03e-2 0.27 

Mean Absolute Error 3.33e-3 

PWS 

Max Error 1 ,39e-2 

Mean Absolute Error 1 ,95e-3 
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Table 3 Range of data, quality indicators for different surrogate models, and weights associated with 
the components of PWS. PRS: Polynomial response surface, KRG: Kriging, RBNN: Radial basis 
neural networks, PWS: PRESS-based weighted surrogate, RMSE: Root mean square error, PRESS: 
Predicted residual sum of squares (in PRS terminology). Here, we give the square root of PRESS so 
as to facilitate each comparison with RMSE. We used a reduced cubic and a reduced quadratic 
polynomial to approximate the Set B and Set C data, respectively. 




SetB 

Set C 

Surrogate 

Parameter 

Value 

Weights Value Weights 


# of points 

439 

109 


Minimum of data 

1.000 

1.052 


Mean of data 

1.040 

1.075 


Maximum of data 

1.097 

1.105 


PRS 


K,j 0.959 

RMSE 4.07e-3 

PRESS 4.84e-3 

Max Error 1.02e-2 

Mean Absolute Error 2.67e-3 


0.44 


0.978 

1.65e-3 

8.74e-3 

3.84e-3 

1.05e-3 


0.36 


KRG 


Process Variance 1 . 1 1 e-4 

PRESS 5.89e-3 


0.37 


1.05e-4 

9.47e-3 


0.34 


RBNN 


PRESS 1.17e-2 

Max Error 1.53e-2 

Mean Absolute Error 1 .47e-3 


0.19 


1.07e-2 

2.57e-2 

2.89e-3 


0.30 


PWS 


Max Error 6.85e-3 

Mean Absolute Error 1 ,30e-3 


7.91e-3 

1.04e-3 


Table 4 Optimal design variables and pressure ratio (P-ratio) obtained using different surrogates 
constructed using Set C data. PRS is polynomial response surface, KRG is Kriging, RBNN is radial 
basis neural network, and PWS is PRESS-based weighted surrogate. 


Surrogate 

Pk ; 

> 

+2? 

4^ 

4^ 

4^ 

4^ 

CD 

Predicted 

P-ratio 

PRS 

KRG 

RBNN 

PWS 


1.120 

1.111 

1.105 

1.109 
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Table 5 Comparison of actual and predicted performance of optimal designs obtained from multiple 
surrogate models (Set C). Each row shows the results of optimal design using a particular surrogate 
and different columns show the prediction of different surrogate models at each optimal design. PRS 
is polynomial response surface, KRG is kriging, RBNN is radial basis neural network, and PWS is 
PRESS-based weighted surrogate. 



Surrogate-prediction using 

Design by 

Actual P-ratio 

PRS 

KRG 

RBNN 

PWS 

PRS 

1.117 


1.103 

1.084 . 

1.103 

KRG 

1.113 

1.112 

llil 

1.085 

1.104 

RBNN 

1.106 

1.106 

1.104 

Will 

1.105 

PWS 

1.114 

1.116 

1.108 

1.102 

Sll 

Average absolute error 

0.0014 

0.0060 

0.0186 

0.0072 


Table 6 Modified ranges of design variables, and fixed parameters in refined design space. 


Variables 

Min 

Max 

Fixed 

parameters 

Value 

Fixed 

parameters 

Value 

h 

0.60 

0.75 

0 

60° 

u 

1.00 

Py 

-2.00 

-1.50 

ti 

1.25 

t 5 

1.00 

Pz 

5.85 

6.00 

t 3 

0.05 

u 

1.00 


Table 7 Range of data, summary of performance indicators, and weights associated with different 
surrogate models in the refined design space. PRS: Polynomial response surface, KRG: Kriging, 
RBNN: Radial basis neural networks, PWS: PRESS-based weighted surrogate, RMSE: Root mean 
square error, PRESS: Predicted residual sum of squares (in PRS terminology). Here we give the 
square root of PRESS so as to facilitate each comparison with RMSE. 


Surrogate 

Parameter Value Weights 


# of points 20 

Minimum of data 1.117 

Mean of data 1.136 

Maximum of data 1.151 

PRS 

R 2 adJ 0.956 

RMSE 1.75e-3 

PRESS 2.74e-3 °- 59 

Max Error 2.53e-3 

Mean Absolute Error 1 . 1 1 e-3 

KRG 

Process Variance 1.21e-4 n . 

PRESS 7.14e-3 

RBNN 

PRESS l.lle-2 

Max Error <1.0e-6 0.16 

Mean Absolute Error <1.0e-6 

PWS 

PRESS 4.21 e-3 

Max Error 1.51 e-3 

Mean Absolute Error 6.62e-4 
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Table 8 Design variables and pressure ratio at the optimal designs predicted by different surrogates. 
PRS is polynomial response surface, KRG is kriging, RBNN is radial basis neural network, and PWS 
is PRESS-based weighted surrogate. 





Surrogate predictions using 


t2 P v 

Pz 

Actual P-ratio 

PRS 

KRG 

RBNN 

PWS 

PRS 

0.60 -1.99 

6.00 

1.150 

■B 

1.151 

1.155 

1.152 

KRG 

0.60 -1.87 

6.00 

1.149 


HUg 

1.152 

1.151 

RBNN 

0.61 -1.97 

5.99 

1.150 

BI«%M 


ISil 

1.151 

PWS 

0.60 -1.96 

6.00 

1.150 

1.151 

1.152 

1.154 

MB 

Average 

absolute error 


7.5E-04 

1.50E-03 

4.25E-03 

1.75E-03 


Table 9 Actual and empirical ratios of gaps between adjacent diffuser vanes. 



Wi 

w 2 

L 

L/Wi 

w 2 /w, 

P-ratio 

Empirical relations 




4.00 

1.60 


Baseline vane 

0.66 

0.94 

2.09 

3.15 

1.42 

1.074 

Intermediate optimal 

0.79 

1.07 

2.03 

2.55 

1.35 

1.117 

Optimal vane 

0.94 

1.11 

2.02 

2.15 

1.19 

1.150 
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Introduction 


Upper-stage expander cycle employs 
radial turbines - allow higher pump 
speeds, improve efficiencies at design 
and off-design operating conditions 

Diffuser pump with vanes was found to 
be more efficient than the vaneless 
design 

The objectives of the current work are 

• to maximize the ratio of static 
pressure at inlet and outlet in a 
diffuser pump via shape optimization 
of the vanes 

• to demonstrate a multiple surrogate 
model based optimization frame- 
work for design and analysis 



Main impeller 
blade 


Inlet guide 
vane 


Diffuser 

vane 


Splitter 

blade 
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Surrogate-based optimization approach 


Define optimization 
problem 

Min f(xj, x 2 --XJ 

s.t. C(x) < 0 
H(^=0 


Evaluate _/(x), 
C(x), H(x) 


x- 


/ 

£ 


Surrogate 

models 


Identify most and 
least important 
variables using GSA 





r ■> 

f 





H 

> 




C 





Perform 

optimization 



Numerical 

simulations 
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Surrogate modeling 


Surrogate model 

• functional relationship between design 
variables and response y * f(x) 

Advantages 

• computationally inexpensive 

• easy coupling with optimization 
software 

• facilitates sensitivity analysis 
Popular surrogates 

• polynomial response surface 

• kriging 

• radial basis neural network 

• ensemble of surrogates 



Surrogate construction 
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Polynomial response surface approximation 




. . 4 - £ 


i<j = 2 


y = ^P + £, y = Xb 


b = (X'X)'x'y 


Assumption: Normally distributed 
uncorrelated noise 

In presence of noise, response surface 
may be more accurate than observed 
data 



Sampling 
data points 




Response 

surface 


+ X 
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y(\) = y(x) 


Systematic departure 


Kriging (KRG) 



c ( Z(x), Z (s), 0 ) = n ex P (-0 , (*, -s,) 2 ) 


1=1 


Trend model 


■ Assumption: Systematic departures 
Z(x) are correlated, noise is small 

■ Gaussian correlation function C(x, 
s,0) is most popular 


y 


Sampling 

data points Linear Trend 
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Radial basis neural networks (RBNN) 


N 

1 ^RBF 

y(x) = Yj 


1=1 


a - y x radbas ( s — x ) ; radbas ( n ) = 


Neurons (radial basis functions) at some of 
data points are used to approximate 
response. Other points are used to estimate 
error 

User-defined constants: 

• spread constant: radius of influence 1/b 

• error goal: minimum sum of square of errors 
Good for modeling fast-changing responses 




<*3/ Output 
Radial basis functions 
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Weighted average surrogate 


Best surrogate model for the given 
problem is unknown 

Use a weighted sum of responses 
from N sm surrogates 


N. 


SM 




Choice of weights w, should reflect 

• confidence in i th surrogate model 

• good surrogate high weight 

• protection against lucky surrogate 
(that models data well but is 
inaccurate) 



RBNN: radial basis neural network 
PWS: PRESS-based weighted surrogate 


/^UNIVERSITY OE , 

^Florida! 


n 


PRESS: predicted residual sum of squa res 
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PRESS-based weighting strategy 


Parametric weighting strategy based on error E, 


w = 


r 





\ 


-1 


+ oc 


avg 


J 



N, 


SM 


E =y E. N- 0<«<1 

avg i I SM ’ 

1=1 


Protects against data 
modeling surrogate 

Here a = 0.05 


■ We use PRESS-based weighting strategy £ = V PRESS 

■ Example of PRESS-based weighted surrogate (PWS) with 
polynomial response surface, kriging, and radial basis 
neural network 


y (x) = w y (x) + w y (x) + w y (x) 

pws v x prs prs krg krg v rbnn' rbnn 


/^UNIVERSITY OF;,! 

ll? FLORIDA > 

s __U/ 
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Global sensitivity analysis 


■ Global sensitivity analysis (Sobol, 1993) is used to assess the 
contribution of different variables to total variability - can identify 
the importance of variables 


Sensitivity index of main effect Sensitivity index of total effect 


ctotal _ Jd*i k,k*i 

' V(f) 



Variance 

Mean 


n/)-£((/-/ 0 ) ! ) 





x { dx 2 ...dx 
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Shape definition 


Shape of diffuser vane is defined 
using parametric Bezier curves and 
circular arcs 

Nine design variables 

• angle 0 

• lengths of tangent vectors t-, -t 6 

• coordinates of P: P , P z 

Performance of a diffuser vane is 
assessed using 3D unsteady N-S 
code PHANTOM with Baldwin-Lomax 
turbulence model 

Each 3D simulation takes nearly 20 
minutes - use surrogates for 
optimization to avoid high cost 
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Surrogate approximation of pressure ratio 


High errors in approximations 

Lack of fit test revealed that the 
order of polynomial is insufficier 

Not enough data to fit higher 
order polynomial 

Large region in design space is 
undesirable (P-ratio < 1.074) 

Improve surrogate quality via 
design space refinement 

PRS: Polynomial response surface 
KRG: Kriging 
RBNN: Radial basis neural 
network 

PRS: PRESS-based weighted 
surrogate 




P-ratio 

W/ 

t 

# of training points 

110 



Minimum of data 

1.001 



Mean of data 

1.041 



Maximum of data 

1.093 



Baseline design 

1.074 


PRS 

R 2 

^ adj 

0.863 



PRESS 

1.33e-2 

0.320 

KRG 

PRESS 

1.02e-2 

0.410 

RBNN 

PRESS 

1.58e-2 

0.270 

PWS 

PRESS 

1 . 1 2e-2 



Poor quality of surrogates 
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Design space refinement 


Identify the region of interest 
using existing surrogates 

• identify the region where we 
observe at least 0.5% 
improvement over baseline 

• sampled 110 points using D- 
optimality criterion 

One simulations failed 

Higher potential of improvement 

Fit surrogate models 

• reduced quadratic polynomial 
response surface 

• no lack of fit 

Quality of surrogate models 
improved 




P-ratio 

w i 


# of training points 

109 



Minimum of data 

1.052 



Mean of data 

1.075 



Maximum of data 

1.105 


PRS 

R 2 adj 

0.978 



PRESS 

8.74e-3 

0.36 

KRG 

PRESS 

9.47e-3 

0.34 

RBNN 

PRESS 

1.07e-2 

0.30 

PWS 

PRESS 

7.53e-3 
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Optimization results 


Optimization using SQP using different surrogates to evaluate objective 


Optimized variables: 



e 


*2 

*3 

t 4 

t 5 

*6 

Py 

p z 

PRS 

60.00 

1.25 

0.75 

0.05 

1.00 

1.00 

1.00 

- 2.00 

5.85 

KRG 

63.58 

1.25 

0.80 

0.05 

0.81 

0.88 

1.43 

- 2.00 

5.85 

RBNN 

64.49 

1.32 

0.78 

0.07 

0.94 

0.54 

1.08 

- 2.04 

5.84 

PWS 

63.65 

1.25 

0.75 

0.05 

1.00 

0.53 

1.09 

- 2.00 

5.85 


Optimized designs from different surrogates are in the same vicinity 

Moderately high cost of evaluating multiple optimal designs but 
significantly less than the cost of getting data 

Many variables hit bounds -> more improvement by relaxing ranges 
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Optimal vane shape 



■ y xxx : Predicted pressure ratio using polynomial response surface, kriging, radial 
basis neural network and weighted average model 

■ PRS is the most accurate surrogate and it results in the best improvement 

■ RBNN was the worst surrogate 

■ All surrogates yield similar optimal design that has a bend in the middle 

■ Significant improvements in the P-ratio (nearly 4% over baseline P ratio of 1 .074) 
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Global sensitivity analysis 




t, 

0,7% 20.3% t 3 



Main effects 


Total effects Validation 


■ Results are shown using PWS surrogate constructed with 440 points 

■ Location of point P and length of left tangent in Section 1 were found to 
be the most important variables 


■ Interactions among variables were not negligible but small 


■ Validation using actual simulations without considering interactions 
show similar trends 
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Dimensionality reduction 


Shape definition is same as used 
earlier 

We fix six design variables near 
to their optimal values as 
observed for PRS design 

Fixed variables 

• 0 = 60° 

• t, = 1.25 

• t 3 = 0.05 

• t 4 = 1 .00 

• t 5 = 1.00 

• t 6 = 1.00 

Three design variables with 
expanded range are -> 
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Optimization results 


■ Optimization using SQP when different surrogates approximate 
response 

■ Optimized variables and responses 



*2 

P v 

Pz 

Actual Y 

yprs 

ykrg 

yrbnn 

y P ws 

PRS 

0.60 

-1.99 

6.00 

1.150 

1.151 

1.151 

1.155 

1.152 

KRG 

0.60 

-1.87 

6.00 

1.149 

1.150 

1.152 

1.152 

1.151 

RBNN 

0.61 

-1.97 

5.99 

1.150 

1.150 

1.150 

1.155 

1.151 

PWS 

0.60 

-1.96 

6.00 

1.150 

1.151 

1.152 

1.154 

1.152 

Average absolute error 


7.5e-4 

1 .5e-3 

4.3e-3 

1 .7e-3 


■ All optimized designs are in the same region of design space and 
significantly better than the baseline design but a data point is better! 

■ The optimal design appeared at a corner that was already sampled 

■ PRS was the most accurate surrogate 
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Flow fields 


N 


8 


7 


6 



Flow 

direction 


RHO 

550.00 
52000 
49000 

460.00 

430.00 
40000 

370.00 
34000 


Baseline 


-4 -2 0 


Y 



8 


RHO 

550.00 

520.00 

490.00 

460.00 

430.00 
40000 

370.00 

340.00 


Optimal design 


-4 -2 0 

V* 


Less separation and recirculation in the optimal design compared to 
the baseline design 

High pressure rise near the exit for optimal design 
More smooth turning of the flow 


Optimal design reduce separation losses 
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Vane loadings 


■ High pressure load near the inlet 

■ Pressure increases with decrease 
in area in the channel and vice- 
versa 

■ Reduction in inlet tip load and 
pressure loading on the vane (area 
bound by the pressure profile) by 
optimization 

■ Optimal vane shape experiences 
highest mean pressure - 
susceptible to failure because of 
the least material 



gRk UNIVERSITY OF 

FLORIDA 
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Summary 


■ Shape optimization of diffuser vane resulted in significant 
improvements in pressure rise - nearly 7% improvement over 
baseline design 

■ The optimal vane is thinner than baseline, allows smooth turning 
of the flow from both edges, and reduces separation but is 
susceptible to stress failure, and may not accommodate 
manufacturing requirements 

■ Surrogate based optimization process helps 

• identify the most important design variables 

• identify region of high performance and the near optimal design 

• provide accurate predictions in the desired region 

■ Best surrogate model can change during optimization, hence 
using multiple surrogates is advantageous 
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Appendix 
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Shape definition 



lnl«t nos* 


Nine design variables 

• define angle 0 

• define lengths of tangent vectors t,-t 6 



Min 

Max 

e 

60 

110 

*1 

1.25 

2.50 

t 2 

0.75 

1.50 

*3 

0.05 

0.30 

t 4 

0.50 

1.00 

t 5 

0.50 

1.00 

*6 

1.00 

2.00 

| P y 

-2.50 

-2.00 

L p z 

5.70 

5.85 


• coordinates of P: P , P z 

Mesh generation using CORGRD - 13065 points 

Each simulation takes nearly 20 minutes - use surrogates for optimization 
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Global sensitivity analysis 


■ Global sensitivity analysis (Sobol, 1993) is used to assess the 
contribution of different variables to total variability - can identify 
the importance of variables 

Variance — ► r(/) = £((/-/ 0 ) 2 ) Mean -^f^[[...[f(x)dx x dx r ..dx N 


Variance 

decomposition 


r(/) = f^+ I 


v , + 


+v, 


\...N V 


i=l 


l</< j<N v 


How to get V h V ...?? 


Function /(x) = /o + ZyX*/)+Z4( x <>*/) + "‘ + ^-*,(*’* 2 >-".*iO 

decomposition 


such that 


f =0 


main effects 


interactions 



then V, = J \J?dXi 


_ j,j*i j,j*ik,k*i 


V(f) 


Sensitivity index of main effect Sensitivity index of total effect 
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Design space refinement 


■ Two ways of design space refinement 

• add 330 more samples - total 440 points (Set B) 

• use poor quality surrogates to identify the region where we may 
have improved performance (at least 0.5% improvement over 
baseline) and add samples in that region - sampled 110 points 
(Set C) 

■ Simulations at these locations - one simulation for each 
case failed 

■ Fit surrogate models to individual data sets 

• reduced cubic polynomial response surface to the Set B data 

• reduced quadratic polynomial response surface to the Set C data 
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Quality of surrogates in refined domains 




Set B 

w? 

SetC 

wf 


# of training points 

439 


109 



Minimum of data 

1.000 


1.052 



Mean of data 

1.040 


1.075 



Maximum of data 

1.097 


1.105 


PRS 

R 2 

^ adj 

0.959 


0.978 



PRESS 

4.84e-3 

0.50 

8.74e-3 

0.36 

KRG 

PRESS 

9.14e-3 

0.27 

9.47e-3 

0.34 

RBNN 

PRESS 

1.17e-2 

0.22 

1.07e-2 

0.30 

PWS 

PRESS 

6.07e-3 


7.53e-3 



■ Surrogate predictions are improved - best surrogate changed 

■ No lack of fit for both data sets 

■ Potential of improvement using the surrogates fit to the Set C data 
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Global sensitivity analysis 




Sensitivity index 


■ Global sensitivity analysis using multiple surrogate models resulted in 
same conclusions - results shown are obtained using PWS 

■ Location of point P (P y , P z ) was most important followed by variable t 2 

■ All other variables were relatively unimportant 

■ Interactions between different variables were not too important 


Location of point P and param eter t 2 are most im porta nt 
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Validation of GSA results 


Vary one variable at a time while keeping 
all other variables fixed at mean values 

Calculate variance using trapezoidal rule 
with five equi-spaced levels for each 
variable 

Actual sensitivity analysis also indicate 
three important variables - 1 2 , P y , P z 

The differences in results of GSA and 
actual computations are attributed to 

• insufficient data to cover entire design 
space 

• inaccuracies in surrogate models 

• actual variance does not account for the 
interactions 



Main effects - actual variance 


Results validate the g lo bal sensitiv i ty analysis 
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Performance of surrogates (Final) 


All data points were better 


than the intermediate 
optimal design 






# of training 
points 

20 


Reduced quadratic fit of 


Minimum of data 

1.117 


pressure ratio in modified 


Mean of data 

1.136 


design space 


Maximum of data 

1.151 


No lack of fit 


Baseline design 

1.074 


Quality of PRS is the best 

PRS 

R 2 adj 

0.956 


and RBNN is the worst 


PRESS 

2.74e-3 

0.59 

PWS is at par with the 

KRG 

PRESS 

7.14e-3 

0.25 

best surrogate 

RBNN 

PRESS 

1.11 e-2 

0.16 


PWS 

PRESS 

4.21e-3 
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